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Abstract. We analyze the wide angle x-ray scattering from oriented spider silk fibers in terms of a quan- 
titative scattering model, including both structural and statistical parameters of the /3-sheet crystallites 
of spider silk in the amorphous matrix. The model is based on kinematic scattering theory and allows for 
rather general correlations of the positional and orientational degrees of freedom, including the crystal- 
lite's size, composition and dimension of the unit cell. The model is evaluated numerically and compared 
to experimental scattering intensities allowing us to extract the geometric and statistical parameters. We 
show explicitly that for the experimentally found mosaicity (width of the orientational distribution) inter- 
crystallite effects are negligible and the data can be analyzed in terms of single crystallite scattering, as is 
usually assumed in the literature. 
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PACS. 81.07.Bc Nanocrystalline materials - 87.15.- 
61.05.cc Theories of x-ray diffraction and scattering 



1 Introduction 



Spider silk is a material which is since long known to ev- 
erybody, but which only more recently receives great ap- 
preciation by the scientific community for its outstanding 
material properties (jT|; 0) ■ Interest here has focused on the 
so-called dragline fiber, the high strength fibers which orb 
web spiders produce from essentially only two proteins to 
build their net's frame and radii, and also to support their 
own body weight after an intentional fall down during es- 
cape. Evolution has optimized dragline fibers for tensile 
strength, extensibility and energy dissipation. Dragline 
silk can support relatively large strains and has a ten- 
sile strength comparable to steel or Kevlar. For the en- 
ergy density which can be dissipated in the material be- 
fore breaking, the so-called tenacy (toughness), values of 
160MJ/m 3 have been reported (0; 0), e.g. for different 
Nephila species, on which most studies have been carried 
out. An understanding of the structural origins of these 
mechanical properties is of fundamental interest, and may 
at the same time serve the development of biomimetic 
material design (H @), using recombinant and synthetic 
approaches (0; HHs). As for other biomaterials, the cor- 
relation between structure and the mechanical properties 
can only be clarified by advanced structural characteri- 
zation accompanied by numerical modelling. To this end, 
not only the mechanical properties (flU [TTh resulting from 
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the structure, but also the structure itself has to be mod- 
elled to exploit and to interpret the experimental data. 
Such efforts have in the past led to a quantitative un- 
der standingof many biomaterials like bone, tendons and 
wood (H3;|ll). 

As deduced from x-ray scattering 

Q H HI M) and 

NMR experiments (fl7l ) , spider silks are characterized by a 
seemingly rather simple design: the alanine-rich segments 
of the fibroin polypeptide chain fold into /3-sheet nano- 
crystallites (similar to poly-L-alanine crystals) embedded 
in an amorphous network of chains (containing predomi- 
nately glycine). The crystalline component makes up an 
estimated 20%-30% of the total volume, and may repre- 
sent cross-links in the polymer network, interconnecting 
several different chains. At the same time the detailed in- 
vestigation of the structure is complicated, at least on the 
single fiber level, by the relatively small diameters in the 
range of 1 — 10 /im, depending on the species. Using highly 
brilliant microfocused synchrotron radiation, diffraction 
patterns can be obtained not only on thick samples of fiber 
bundles, but also on a single fiber (0; [l§ HO; EH; [13; HI) . 
Single fiber diffraction was then used under simultaneous 
controlled mechanical load in order to investigate changes 
of the molecular structure with increasing strain up to 
failure (pil l. Note that single fiber diffraction, where pos- 
sible, is much better suited to correlate the structure to 
controlled mechanical load, since the strain distribution in 
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bundles is intrinsically inhomogcncous, and the majority 
of load may be taken up by a small minority of fibers. 

While progress of the experimental diffraction studies 
has been evident, the analysis of the data still relies on the 
classical classification and indexing scheme introduced by 
Warwicker. According to Warwicker, the /3-sheet cristal- 
lites of the dragline of Ncphila fall into the so-called system 
3 of a nearly orthorhombic unit cell (flil ; |25T ) with lattice 
constants 10.6 x 9.44 x 6.95 A (0). To fix the coordinate 
system, they define the x-axis to be in the direction of the 
amino acid side chains connecting different /3-shects with 
a lattice constant of a x = 10.6 A while the j/-axis denotes 
the direction along the hydrogen bonds of the /3-sheets 
with a lattice constant of a y = 9.44 A. Finally the z-axis 
corresponds to the axis along the covalent peptide bonds 
(main chain) with a lattice constant a z = 6.95 A. The 
z-axis with small lattice constant is well-aligned along the 
fiber axis. Note that while we follow this commen conven- 
tion, other notations and choices of axis are also used in 
the literature. While helpful, the indexing scheme docs not 
give any precise information on the exact structure of the 
unit cell, and on the fact whether the /3-pleatcd sheets are 
composed of parallel or antiparallcl strands, and how the 
two-dimensional sheets are arranged to stacks. To this end, 
not only peak positions but the entire rather broad inten- 
sity distribution has to be analyzed. To interpret the scat- 
tering image it is essential to know, whether correlations 
between different crystallites are important or whether the 
measured data can be accounted for by the scattering of 
single crystallites, averaged over fluctuating orientations. 
It is also not clear, whether correlations between trans- 
lational and rotational degrees of freedom are important. 
Finally, the powder averaging taking into account the fiber 
symmetry experimental mosaicity (oricntational distribu- 
tion) must be quantitatively taken into account. 

In this work we built a scattering model based on kine- 
matic scattering theory and compare the numerically cal- 
culated scattering intensity with the experimental wide 
angle scattering distribution measured from aligned silk 
fibers. The numerical calculations allow for a quantita- 
tive comparison to the experimental data and yield both 
structural and statistical parameters. Note that the small 
size of crystallites, leading to correspondingly broad re- 
flections, and a generally rather low number of external 
peaks exclude a standard crystallographic approach. The 
structural parameters concern the crystal structure, in 
particular the atomic positions in the unit cell, and the 
crystallite size. The composition is assumed to be that of 
ideal polyalanine without lattice defects. This assumption 
is partly justified by the fact that the small size of the 
crystallite is the 'dominating defect' in this material. The 
statistical parameters relate to the orientational distribu- 
tion of the crytallite symmetry axis with respect to the 
fiber axis and the correlations between crystallites. The 
model is constructed based on a quite general approach, 
allowing independently for correlations between center-of- 
mass positions (translations) and crystallite orientations 
(rotations). 



The paper is organized as follows. In Sec. [2] we in- 
troduce the basic model with parameters for the crystal- 
lite size, lattice constants and statistical parameters for 
the crystallites' position and orientation. Subsequently, in 
Sec. [3] we compute the scattering function for our model. 
Sec. |4] specifies the different atomic configurations which 
are conceivable for polyalanine. The main results and the 
comparison of calculated and measured intensities are pre- 
sented in section [5l before the paper closes with a short 
conclusion. 



2 Model 

In this section we present a simple model of spider silk 
which allows us to compute the scattering function 



(1) 



as measured in X-ray scattering experiments. The atomic 
positions are denoted by rj and the atomic form factors 
by fj. The modeling proceeds on three different levels: 1) 
On the largest lcngthscales spider silk is modelled as an 
ensemble of crystallites embedded in an amorphous ma- 
trix and preferentially oriented along the fiber axis. 2) 
Each crystallite is composed of parallel or antiparallcl /3- 
sheets. 3) Each unit cell contains a given number of amino 
acids, whose arrangements have been classified by War- 
wicker (flit). In Fig. ([TJ bottom) we show an illustration of 
Bombyx mori by Geis |26l). 

In the following we shall build up a model, starting on 
the smallest scales and working up to the whole system. 
Subsequently we will compare our calculated scattering 
functions with experimental data. Thereby we are able to 
determine the arrangement of atoms in the unit cell which 
optimizes the agreement between model and experiment. 



2.1 Unit cell 

One unit cell of a crystallite is described as a set of atoms 
at positions Tk relative to the center of the unit cell, where 
k = 1,2,..., if runs through the atoms of the unit cell 
(see Fig. [TJ top panel, for a schematic drawing). Each 
atom is assigned a form factor specifying the scat- 
tering strength of the respective atom type. 



2.2 Crystallite 

A crystallite is composed of M = M x M y M z unit cells, 
replicated M x , M y and M z times along the primitive vec- 
tors a. x , SL y and a z , respectively. The unit cells in a crys- 
tallite are numbered by a vector index m = (m x , m y , ro z ), 
where m v = 1,2, ...,M V for v = x,y,z. Hence the center 
of mass of unit cell m has position vector 
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Fig. 1. Top: Schematic view of the unit cell. Atom k is loacted 
at position rt, and the atom type is specified by the form factor 
fk of the atom. For simplification schematic illustrations are 
in 2D, if possible, even though the model refers to three space 
dimensions, D = 3. Bottom: Possible configuration inside the 
unit cell (illustration adapted from H^)). 

Actually it is more convenient to measure all distances 
with respect to the center of the whole crystallite 

(M x + l)a, + (M y + IX + (M x + IK 

Scm - 2 

so that s m = s m — s cm denotes the position of unit cell m 
relative to the center of the crytallite to which it belongs 
(see Fig. 02) and the position of atom k in unit cell m 
relative to the center of the crystallite is 

r m .fc = s m + r fe (2) 



2.3 Ensemble of Crystallites 

The whole system is composed of N such crystallites at 
positions with j = 1, 2, ...N (see Fig. 03). The crystal- 
lites are not perfectly aligned with the fiber axis, instead 
their orientation fluctuates. The orientation of a single 
crystallite is specified by three Euler angles 4>^\ tp^ 
(see Fig. 02 bottom). Here we have chosen the z-axis as the 
fiber axis and 9 denotes the angle between the z-dircction 
of the crystallites (direction of covalent bonds) and the 
fiber axis. The atomic positions of the rotated crystal- 
lite are obtained from the configuration which is perfectly 
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Fig. 2. Schematic view of the crystallite. The vector s m goes 
from the center of the crystallite to the unit cell m. From there 
r-fc goes to atom k. 

aligned with the z-axis by applying a rotation matrix Dy' 
(see Fig. Efl top): 

rmi fc = RW +£ (j) rm, fc . (3) 



z 




X 



Fig. 3. The whole system is composed of crystallites at posi- 
tions R 0) . They can be rotated by rotation matrices D} J >. 

In the experiment the scattering intensity is obtained 
for a large system, consisting of many crystallites. Hence 
it is reasonable to assume that the scattering function is 
self- averaging and hence can be averaged over the posi- 
tions and orientations of the crystallites. We use angular 
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Fig. 4. Top: The position of atom k in unit cell m of the ro- 
tated crystallite is obtained by applying the rotation matrix 
D]Cm,k to the position vector in the aligned configuration. Bot- 
tom: Illustration of Euler angles in 3D. The angle 9 specifies 
the deviation of the crystallite's a z -axis from the fiber axis of 
the strand. <p is the rotation of the crystallite about it's own 
a z -axis, and ip is the rotation of the a z -axis about the fiber 
axis (after the ^-rotation). 

brackets ( ) to denote the average of an observable O over 
crystallite positions R^ and orientations D^': 



(O) 



r N 

/ [] (a*R^V^) P po8 (R( 1 >,..,RW)0. 



(4) 



Here, the crystallite positions follow the distribution func- 
tion P pos (R' 1 \ ...,W- N >) which in general includes corre- 
lations. In contrast the orientation of each crystallite is 
assumed to be independent of the others. The average 
over all orientations 



VD {i) = dcp^dO^dip^ sin6 »(i) P anglc (0(^,6» (j) , 



involves the angular distribution function Tangle (</>, 0, ip), 
which is the same for each crystallite. In the simplest 
model we assume a Gaussian distribution for the devia- 
tions of the crystallite axis from the fiber axis 'P a ngie(</ ) , 0, ip) 

~ exp(— J™) while all values of (p and ip between and 2tt 



are equally likely. 



2.4 Continuous background 

The space between the crystallites is filled with water 
molecules and strands connecting the crystallites, which 
are called amorphous matrix. In the scope of this work, 
we are not interested in the details of its structure and 
model it as a continuous background density go, chosen 
to match the average scattering density of the crystallite 
(see Fig. [5]): go = J2k=i fk/V uc . Here V uc is the volume of 
the unit cell. Inside the crystallite there is no background 
intensity which is achieved in our model by cutting out 

For 



a spherical cavity, V(r), around each atom r 
simplicity we assume a Gaussian cavity 



.0') 

'm,fc 



V(v) 



f 



(27r£ 2 ) 3 / 2 



cxp 



2e 



(6) 



and choose the amplitude such that the average density 
inside the crystallites is zero: 



d 3 7 



M K 



crystallite 



fe=i 



r m .k) 



(7) 



where the sum over the vector index 5Z m means 

E»"=iE!lt=r Witri tms assumption / = J2k=ifk/ K 
is simply the average form factor. The typical size of the 
cavity £ has to be comparable to the nearest neighbour 
distance to make sure that there is no "background" in- 
side the crystallites. Models with and without continuous 
background are compared in Appendix |XJ 

This completes the specification of our model and we 
proceed to compute the scattering function as predicted 
by the model. 



3 Scattering function 

Given the atomic positions v^ k , the background density 
g(r) and the statistics of the crystallites' orientations and 
positions, we calculate the scattering function: 

G(q) = 

N M K r 

f k ex P(^ r mlfc) + / d3r e( r ) exp(iqr) 

A 1 -m 7„ 1 ** 



j — 1 m k=l 



(5) 



(8) 
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• atoms 

- background 



Fig. 5. Illustration of the continuous background, modelling 
the amorphous matrix. Outside the crystallites it has a homo- 
geneous scattering density go, which matches the mean scat- 
tering density of the crystallites. In this illustration £ is 0.45 
times the mean nearest neighbour distance. 



Here g(r) is the background intensity whose Fourier trans- 
form reads: 



J d 3 r g(r) cxp(iqr) 



N M K 

d 3 rcxp( iq r) | ft-/EEE y ( r - r ^) 

j=l m fc=l 
N M K 

= QoVS^o - V(q) £ £ £ exp(iqrg fc ) (9) 

j=l m k=l 



The uniform density, giving rise to a contribution pro- 
portional to <5 q ,o, does not contain information about the 
structure of the system. Furthermore the central beam has 
to be gated out in the analysis of the experimental data. 
Hence we neglect the uniform contribution and obtain for 
the scattering intensity © 



G(q) 



N M K 



L m,fc, 



(10) 

The cavities give rise to effective form factors Fk(q) = 
fk — y{f\) accounting for the scattering of the atoms them- 
selves, /fc, and the cavities around them, V'(q). Note that, 
if we want to switch off the background / — > 0, we return 
to the original form factors -Fi-(q) — > fk- 



Inserting the average ( ) and multiplying out the mag- 
nitude squared in (fl"Q"|) yields: 



G (*)=f ft {d 3 R {j) m {j) ) ^p OS (R (1) ,...,R (Ar) ) 

N 

£ exp(iq(R« -R (/) )) 



M K 



X 



J2 £ ^(q)^'(q) 



in . in ' k,k'—l 

x exp(iq(^r m , fe -D^r m%fc 0) (11) 

Note, that the double sum over the crystallites j and f 
also applies to the rotation matrices D}^ and Dy ' in the 
second line. We now split the scattering function in two 
terms G(q) = Gi(q) + G2(q): The first one, Gi(q), only 
includes the terms j = j' of that sum and thus incorpo- 
rates scattering of the same crystallite, but not scattering 
from different crystallites. The second one, G2(q), only in- 
cluding the terms with j ^ j', takes into account coherent 
scattering of two different crystallites. 

3.1 Incoherent part 

We first consider the case j = j' of the sum, i.e. the 
contribution to the scattering function which is incoher- 
ent with respect to different crystallites. Here, the term 
exp(iq(R« - rC*'))) gives 1, and therefore the integral 
over the crystallite positions R'-?) can be performed and 
trivially gives 1 due to the normalization of the spatial dis- 
tribution V pos (R {1 \ ..,RC!). Further more, for each sum- 
mandj, all integrations over the orientations J2 i ■ ■■■,12. \ 
except Dy', can be performed and also yield 1. Therefore, 
the N terms with j = j' simplify to: 

„ M K 

G 1 (q)=N JmH £ *Uq)*£(<i) 

II i. in 7 — 1 

x exp(iq(|2r mife - Qx m ' ,k')) 
„ MP 
= N VR ££i 71 fc(q)cxp^(£ T q) ■ r m , fe ) 

•* m p=l 

= N J P^|A(£ T q)| 2 , (12) 



where D. is the transpose of the matrix D_ and 



M K 



M^) := £ £ cxp(iq ■ r mifc ) 



(13) 



m fc=l 



is the unaveraged scattering amplitude of a single unro- 
tated crystallite. Hence the interpretation of the incoher- 
ent part Gi(q) is straightforward: Each crystallite con- 
tributes independently, each with a given orientation. The 
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orientation can be absorbed in the scattering vector q — > 
_D T q. so that the contributions of two crystallites with dif- 
ferent orientations are simply related by a rotation of the 
scattering vector. In the macroscopic limit we are allowed 
to average over all possible orientations and get a sum of 
N identical terms. 



3.2 Coherent part 

We now consider the contribution to the scattering in- 
tensity from different crystallites, i.e. the case j ^= j' 
in Eq. lfTTj) . In analogy to the above calculation all inte- 
grations over the orientations D. , ■■■,D} N \ can DC per- 
formed except for and Dy ' : 

G 2 (q) = J d 3 R^ ■ • -d 3 i?W P pos (R (1) , -,R (JV) ) 
x ^exp(«q(R« -R^)] 



iri.rn 7 k,k' — l 

x cxp(iq(^r mifc -£'rm',k')) 



(14) 



where we have used that the angular distribution is the 
same for all crystallites. We introduce the structure factor 
of the crystallite positions: 



1 / N 

Siq ) = N\ £ exp(iq(R«-R>">) 



1 + i ffa 



.d 3 i?w^ P os(R (1) ,...,R w ) 



x cxp^q(R (j) - R.V ))J (15) 



and observe that the two upper lines in (|14|) are just 
N (5(q) — 1). Hence G 2 can be simplified to: 

G 2 (q)=7V(S(q)-l) 

. M K 

J V S,HY, Fk ^ ex p(*(£fq) ■ r m,fc) 



7V(S(q) 



m k=l 
1 



J V]2A(]2 T ci) 



(16) 



Again the interpretation is straightforward: For coherent 
scattering the amplitudes of individual crystallites with 
different orientations add up, as expressed by J 'DD.^D^ q) 
Spatial correlations of the centers of the crystallites arc 
accounted for by the structure function. 
The total scattering function 



G(q) 

N 



J ^|^U2 T q)f + (S(q) -1) / VQAilfq) 



is reduced to the scattering amplitude of a single crytallite 
A(q), which we compute next. Note that if the angular 
spread of the crystallites can be neglected, in other words 
all crystallites are approximately aligned, then the above 
expression reduces to G(q) = NS(q) |A(q)| 2 , as one would 
expect. 

If there is thermal motion of the atoms around their 
equilibrium positions due to finite temperature, the inten- 
sity of the scattering function G(q) is weakened for larger 
g-values (e.g.l27j). The resulting scattering function has to 
be multiplied with the Debye- Waller factor 



G DW (q) = G(q) • exp(-qV)) . 



(18) 



where (u 2 ) is the mean square displacement of the atoms 
in any direction. 



3.3 Scattering amplitude of a single crystallite 

The calculation of the scattering amplitude of a single 
crystallite A(q) (defined in Eq. [T3|) follows standard pro- 
cedures. We substitute the atomic positions of Sec. 12.21 
and note that the sums over m x , m y and m z are geomet- 
ric progressions which can easily be performed, yielding: 

K 

A{q) = L Mv (qa x )L My (qay)L Mz (qa. z )^2 F k (q) cxp(iqr k ) 

k=l 

(19) 

Here L^[ v {qa v ) = b ™^^^ I /2)^ * s the well-known Laue 
function, which has an extreme value, when its argument 
qa„ is a multiple of 2tt. It is noteworthy, however, that the 
position of the extremum of the magnitude of the scatter- 
ing amplitude A(q) may be shifted, if the form factor of 

the unit cell Ylk=i -^Mq) cx P(*q r fe) nas a non-vanishing 
gradient at that position and M v is finite (and there- 
fore the peak width of the Laue function is nonzero). In 
this case the resultant peak may be shifted by a value 
of the order of its peak width. Consequently care has to 
be taken, when determining lattice constants from exper- 
imental peak positions. 



4 Atomic configuration of the unit cell 



The computation of the scattering function G(q) requires 
the atomic configuration {rfcjfLi of the unit cell, which 
we discuss next. 



(17) 



4.1 Unshifted unit cells 



It is known that the crystallites are composed of poly- 
alanine strands (see (|28l : [it) ) . In Fig. [Bl two alanine amino 
acids of this strand are shown. Its conformation, shown 
on the right side, is well established and was created with 
Yasara (|29r ). There are two constraints for the strand: 
First, the subsequent alanines in the strand must have 
the same orientation so that the strand does not have 
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a "twist" and can produce periodic structures. Second, 
the distance between adjacent alanines has to match the 
size of the unit cell. These two constraints allow for a 
unique choice of the two degrees of freedom, namely the 
Ramachandran angles |30l ) <P and W which are the dihedral 
angles for the bonds C a -N and C a -C, respectively. 





C = 


O 


H- 


1 


CH 




N- 


-H 


O 


= C 




H 3 C- 


1 

-c a - 


-H 


H- 


1 

-N 






Fig. 6. (Color online) Chemical structure (left) and conforma- 
tion (right) of a poly-alanine strand. Two alanines are shown. 
The CH3 group is characteristic for the alanine amino acid 
and is bound to the so called C a atom. The arrow indicates 
the direction C — > C a — > N of the backbone. 



Many of the described poly-alanine strands side by side 
form a stable crystalline configuration, the (3 -pleated- sheet. 
We assume an orthorombic unit cel{]] , consisting of four 
alanine strands, as illustrated in Fig. [7j Thus one unit cell 
contains 8 alanine amino acids. Furthermore we define the 
spatial directions in the usual way (e.g. Ihll): 

x: Direction of the CH3-groups and Van-der-Waals inter- 
actions between sheets lying upon each other. 

y: Direction of the hydrogen bonds between the O-atom 
of one strand and the H-atom of the neighboring strand. 

z: Direction of the covalcnt bonds along the backbone. 

Accordingly, & x , a. y and a z are the principal vectors point- 
ing in these directions and a x , <%, a z their magnitudes. 
Note furthermore that due to symmetry the distance be- 
tween the strands has to be a x /2 in the x-direction and 
a y /2 in the y-direction. 

In general one has to distinguish between the parallel 
and antiparallel structure. In the parallel structure the di- 
rection of the atom sequence C — > C a — ► N in the strand's 
backbone is the same for all strands (left side of Fig. [7J. 
For the antiparallel structure, this direction is alternating 
along the y-axis (right side of fig.[jj. Both will be consid- 
ered in the following analysis. 



We note that the assumption of an orthorhombic unit cell 
is restrictive. In a more general approach one can give up this 
assumption and use a smaller unit cell allowing for different 
shifts. The best fit to the data is obtained for a shift which can 
also be achieved with an orthorhombic unit cell. For the sake 
of clarity, we stick to the established unit cell notations and 
indexing of the reflexes here. 



4.2 Possible shifts inside the unit cell 

The scattering intensity is not only sensitive to the con- 
formation of the poly-alanine strands, but also to the dis- 
tance and orientation of different strands relative to each 
other. Besides the question of parallel or antiparallel struc- 
ture, the four strands can also be shifted with respect to 
each other. In principle there are four ways to displace 
them (see Fig. [7]): 

— Shifting strands 1 and 2 in the y-direction by a value 
Ayi2 ■ This displacement is performed in Fig. [8] 

— Shifting strands 1 and 2 in the z-direction by a value 
Az\2- Because of the CH3-groups extending into the 
layers above and below (as seen in Fig. [JJ top panel), 
a displacement like this is only possible, if those two 
strands have a shift in the y-dircction of Ay\2 ~ 

as well. In this case the CH3-groups can pass each other 
without overlapping. 

— Shifting strands 2 and 4 in the z-dircction by a value 
Az ?a. T his shift was originally suggested by Arnott et 
aim). 

— Shifting strands 2 and 4 in the x-direction. However 
this displacement would have a high energy cost, be- 
cause it would break the hydrogen bonds between the 
H- and O-atoms of neighboring strands. Since, more- 
over, no reasonable result could be achieved perform- 
ing such shifts, it will not be included in our discussion 
any further. 

Note that shifting strands 1 and 2 in the cc-direction 
or strands 2 and 4 in the y-direction is associated with 
resizing the unit cell in the x- or y-direction, respectively. 



4.3 Variations between crystallites 

For real systems, the composition of the crystallite is cer- 
tainly not fixed, but may vary from crystallite to cyrstal- 
lite. These variations can easily be included into our model 
introducing a probability distribution P{{r m k}) for the 
crystallite configuration {r mj fc}, with the normalization 
J2{r m fc } P({ r m,k}) = 1- The calculation is performed com- 
pletely analogous to oricntational distribution. The scat- 
tering function becomes 



g(q) 

N 



{r m , k } 



(20) 



(5(q) - 1) 



{r m , k } 



Later (sec. l5.2[) we will investigate the effect of variable 
crystallite sizes as well as the influence of small fractions 
of glycine inside the crystallites. 



5 Results 

5.1 Experimental scattering function 

Two types of samples have been investigated: fiber bun- 
dles and single fiber preparations. Single fibers demand 
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a highly collimated and brilliant beam, but are better 
defined in orientation and are amenable tosimultaneous 
strain-stress measurements. 

An oriented bundle of major ampullatc silk (MAS) of 
Ncphilia Clavipes was measured at the D4 bending mag- 
net of HASYLAB/DESY in Hamburg. The fibers were 
reeled on a steel holder and oriented horizontally in the 
beam. The estimated number of threads was 400-600. Pho- 
ton energy was set to E = 10.9 keV by a Ge(lll) crys- 
tal monochromator located behind a mirror to suppress 
higher harmonics. Data was collected with a CCD x-ray 
camera (SMART Apax, AXS Bruker) with 60 /im pixel 
size and an active area of 62 cm x 62 cm. Illumination was 
triggered by a fast shutter. The momentum transfer was 
calibrated by a standard (corrundum) . Raw data was cor- 
rected by empty image (background subtracted). 

The single fiber experiments have been carried out at 
the microfocus beamline ID13 at ESRF, Grenoble |3ll ). 
A 12.7 keV x-ray beam was focused with a pair of short 
focal length Kirkpatrick Bacz (KB) mirrors (|32l ) to a 7fim 
spot at the sample. This focusing scheme provides a suf- 
ficient flux density (6.8 • 10 15 cps/mm 2 ) to obtain diffrac- 
tion patterns from single dragline fibers. The single fiber 
diffraction patterns were recorded with a CCD detector 
positioned 131 mm behind the sample (Mar 165 detec- 
tor, Mar USA, Evanston, IL). One of the beamline's cus- 
tom made lead beamstops (approx. 300/ito diameter) was 
used to block the intense primary beam. The raw data was 
treated as follows: (i) both the image and the background 
(empty beam) were corrected by dark dark current, and 
(ii) the (empty beam) background was subtracted from 
the image. The peaks which arc significantly broadened by 
the small crystallite size (see below) can then be indexed 
to the orthorombic lattice described above. Typical scat- 
tering distribution for both types of sample preparations 
are shown in fig. [9] as a function of parallel and vertical 
momentum transfer. More details on experimental proce- 
dures and on the sample preparation by forced silking can 
be found in M 



5.2 Scattering function from the model 

It is our aim to determine those crystallites' parameters, 
which best match the experimental result. The free pa- 
rameters of our model are the three shifts Ayi2, Azw and 
AZ24, the unit cell dimensions a x , a y and a z , the crystal- 
lite size in the three directions M x , M y and M z , as well as 
#o, the tilting angle of the crystallites away from the fiber 
axis. 

The parameters of our model affect the scattering in- 
tensity in different ways, which allows us to at least par- 
tially separate the effects of different parameters. The crys- 
tallite size (M x , M y ,M z ) determines the peak widths, whereas 2 In principle there can be an influence because of the 0-tilt 
the length of the principal vectors & x , & y and a z determine of the crysta llites with respect to the fiber axis (see section O 
the peak position. (We have to keep in mind, however, that and fig . gj. However, the scattering amplitude A(q) shows a 
the peak position can differ from the extremal values of discrete peak structure and, for small ^-rotations, the out-of- 
the Laue functions, as explained in sectionESl) The shifts plane reflections (which are influenced by the z-components) 
Ayi2, Az\2 and Az 2 4, as described in Sec. (|4.2p . affect the are too far away to have an impact on the in-plane intensity 
relative peak intensities via the form factors of the unit profile. 



cell, Ylk=i Fk(s0 exp(iqr^). Finally, the parameter 6q is 
responsible for the peak widths in the azimuthal direction 
on the scattering image. 

From Eq. (p~9j) it is clear that the z-components of the 
atom positions {rk}ff = i arc irrelevant for the scattering 
amplitude A(q) in the xy-planc, i.e. if the z-componcnt 
of q is zero. Therefore, parameters affecting only the z- 
components - especially the mentioned shifts in the z- 
direction - will not influence the intensity profile of G(q) 
in the a;y-plane0 Analogously, the scattering profile in the 
z-dircction is independent of parameters influencing the x- 
and y-directions. Consequently, the sections of the scatter- 
ing profile along and perpendicular to the fiber axis can 
be matched to subsets of the parameters separately. The 
intensity profile off the z- and xy-axes, taking into ac- 
count all dimensions of the crystallite, can be seen as a 
consistency check for the found parameters. 

The experimental scattering data clearly reveal a (002) 
peak, Fig. [9] This peak is allowed by symmetry, however it 
is extremel y w eak in the antiparallel structure suggested 
by Marsh (|35T ) and shown in Fig. [TDJ The reason is the 
following: the electron density within the unit cell pro- 
jected along the z-axis is almost uniform, varying by ap- 
proximately 10%. We therefore propose two alternative 
mechanisms generalizing the classical (Marsh) model of 
the antiparallel unit cell. By both mechanisms the inten- 
sity of the (002) peak will increase in agreement with the 
experiment: 

a) the shift of strands 2 and 4 in the z-direction, i.e. a 
nonzero Z\z24-shift or 

b) structural disorder affecting the almost uniform elec- 
tron density. 

We first discuss case a). The uniform electron density 
is disturbed by a shift Az2A 7^ 0. The intensity of the 
(002)-reflection grows accordingly with an increasing shift 
Z\z24- Adjusting the Z\z24-shift yields results consistent 
with experiment. 

In table Q] we present the results for the parameters 
of the model, obtained from optimising the agreement be- 
tween the calculated scattering function and the experi- 
mental one. For comparison we show the set of parameters 
for both, the parallel and the antiparallel structure. On the 
basis of the experimental data, one can not discriminate 
between the parallel and the antiparallel structure. 

The scattering intensities, as calculated with these val- 
ues, are shown in Fig. 1111 The crystallites are randomly 
tilted with respect to the fiber axis, so that on average 
the system is invariant under rotations around the fiber 
axis. Consequently the scattering image also has rota- 
tional symmetry about the z-axis and the q x - and g y -axis 
are indistinguishable and denoted by q xy . A section along 
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presented calculation 


Warwicker (34) 


Marsh (35) 


Arnott (28) 


structure of 


Nephila clavipes 


Bombyx mori 


Tussah Silk 


poly-L- alanine 


alignement 


parallel 


anti-parallel 


anti-parallel 


anti-parallel 


anti-parallel 


a x 


10.0 A 


10.0 A 


10.6 A 


10.6 A 


10.535 A 




9.3 A 


9.3 A 


9.44 A 


9.44 A 


9.468 A 




6.95 A 


6.95 A 


6.95 A 


6.95 A 


6.89 A 


M x 


1.5 W 


1.5 W 








My 


5 


5 








M z 


9 


9 












ay/A 





ay/A 


±a y /A I"' 


Az 12 

















Az 2 4 





-Oz/6 








-a g /10 


Oo 


7.5° 


7.5° 








(u 2 ) 


o.i A 2 


o.i A 2 









Table 1. Summary of parameters. The left two columns show the best match between experimental and calculated scattering 
functions. For a z = 6.95 A, the resulting Ramachandran angles are <1> = —139.0° and = 136.9°. (u 2 ) was used for the Debye- 
Waller factor in eq. (|18[) . The three right columns compare our obtained parameters with the literature. 

Note that each unit cell contains two layers of alanin-strands in s-direction. Therefore M x = 1.5 corresponds to three layers 
of /3-sheets in a single crystallite. 

Statistical model: A layer is shifted by a value +a y /A or —a y /A with respect to the previous layer, where + and — are 
equally likely. 



the (foy-axis is shown in fig. I12| top panel. The mismatch 
for g-values slightly larger than the (120)-peak is plausible, 
because in this region the amorphous matrix contributes 
noticeably to the experimental scattering intensity, but 
has been neglected in the model. The oscillations of the 
calculated scattering image for low q- values are side max- 
ima which arc suppressed by fluctuations in the crystallite 
sizes (see sec. I4.3[) . The corresponding scattering intensi- 
ties are shown in Fig.ll2land Fig. [TU left. Clearly the side 
maxima have been suppressed. 

We now discuss an alternative mechanism to generate 
a stronger (002) peak, e.g. by introduction of disorder in 
the amino acid composition of the unit cell (case b) . Poly- 
alaninc as a model for the crystallites in spider silk is an 
over-simplification, since the amino acid sequence hardly 
allows for a pure poly-alanine crystallite. Instead we ex- 
pect that other residues must be incorporated into the 
crystallite even if energetically less favorable to compro- 
mise the given sequence. In particular, it is highly likely 
that also glycine amino acids are embedded in the crys- 
tallites (|25r ). This can be easily implemented by replacing 
randomly selected alanine amino acids of the crystallites 
with glycine (see sec. 14. 3|) . It is found that the intensity 
of the (002) peak increases with the fraction of substi- 
tuted alanines. In Fig. [10] we compare the original Marsh- 
structure (without gylcine) to a structure with the same 
parameters, but with alanine replaced with glycine ran- 
domly with a probability p g \ = 0.375. The random substi- 
tution has clearly produced an intensity of the (002) peak 
comparable to experiment. 




0.5 1 1.5 2 




0.5 1 1.5 2 



Fig. 12. (Color online) Top: Comparison of a section of the 
experimental (•) and the calculated scattering intensity (-■- 
parallel, -♦- antiparallel). Sections of the profiles in Fig. [11] 
along the q xy -axis, i.e. the scattering profile perpendicular to 
the fiber axis, are shown. Bottom: as top, but with a Gaussian 
distribution (rounded to integers) of the crystallite sizes M„ 
My and M z . The widths are AM X = 2, AM y = 0.75 and 
AM Z = 3 respectively. 



6 Conclusions 

We have developed a microscopic model of the structure 
of spider silk. The main ingredients of the model are the 
following: 



a) Many small crystallites are distributed randomly in an 
amorphous matrix, 

b) the orientation of the crystallites fluctuate with a pref- 
erential alignment along the fiber axis, 
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c) each crystallite is composed typically of 5 x 2 x 9 unit 
cells, 

d) each unit cell contains four alanine strands, constructed 
with Yasara and shifted with respect to each other. 

Disorder can be generated by randomly replacing ala- Appendix 
nine with glycine. 



knowledge financial support by the Deutsche Forschungs- 
gemeinschaft (DFG) through Grant SFB 602/B6. 



We have computed the scattering intensity of our model 
and compared it to wide-angle x-ray scattering data of 
spider silk. Possible inter-crystallite correlations are unim- 
portant, given the measured orientational distribution. In 
other words, even if significant center-of-mass correlations 
between crystallites were present, the orientational distri- 
bution would suppress interference effects, with the excep- 
tion of the (002) peak, which is least sensitive to orienta- 
tional disorder. The contribution of coherent scattering is 
discussed in detail in Appendix [Bj 

A homogeneous electron density background is a nec- 
essary feature of the scattering model. Calculation of the 
crystal structure factor in vacuum does not only lead to 
an incorrect overall scaling prcfactor (which is important 
if absolute scattering intensities are measured), but also 
leads to a scattering intensity distribution with artifacts 
at small and intermediate momentum transfer. 

The comparison between model and data fixes the pa- 
rameters of the unit cell and the crystallite for the two 
possible cases, the parallel and the antiparallel structure, 
respectively, as shown in Table [T] The two models with 
parallel and antiparallel alignment of the alanine strands 
yield comparable agreement with the experimental data. 
Also a refined model in which alanine is randomly replaced 
with glycine give reasonable results. Hence we cannot rule 
out one of these structures. 

Our model is similar to the model of the poly-L-alaninc 
of Arnott et al. (jH). Their model does incorporate a Az 2 4- 
shift. However, our structure shows a better agreement 
with the experimentally measured scattering function us- 
ing a value of Az 2 a = —ciz/6- 

While we have concentrated here on the wide-angle 
scattering reflecting the crystalline structure on the molec- 
ular scale, we note that the same model can be used for 
small-angle scattering to analyze the short range order 
between crystallites in the presence of orientational and 
positional fluctuations. In particular, the model can de- 
scribe the entire range of momentum transfer and the 
transition from wide angle scattering (WAXS) to small 
angle scattering (SAXS). Note that WAXS is usually de- 
scribed only in the single object approximation, neglecting 
intcr-particle correlations. Contrarily, SAXS is mostly de- 
scribed in contiunuum models without crystalline param- 
eters. Here both are treated by the same approach, which 
is a significant advantage for systems where the length 
scales are not decoupled. 
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A Effect of the continuous background 

In this section we explain why it is necessary to include 
the continuous background between the crystallites, intro- 
duced in section [2~4l 

Without the background, the system has vast, unphys- 
ical density fluctuations on the length scale of the crys- 
tallite distances, resulting in a large scattering function 
G(q) for small g-values. As already explained in section 
12.41 these density fluctuations are unphysical, because the 
space between the crystallites is filled with the amorphous 
matrix and water molecules. Fig. [13] shows the the scat- 
tering profiles in xy-direction with and without the con- 
tinuous background. As expected the system without the 
background shows a large increase of the scattering func- 
tion for small g-values. The countinous background, how- 
crver, acts as a low-pass filter on the scattering density 
and therefore annihilates the large intensities for small q. 




Fig. 13. Scattering function in xy-direction with (□) and with- 
out (■) continuous background. Without background, density 
fluctuations on large length scales cause an increase of the scat- 
tering function for small g-values. 



B Relevance of the coherent part of the 
scattering function 

Here we discuss the infulence of the coherent part of the 
scattering function of eq. (fT7|) . Fig. [T4l shows a comparison 
of the incoherent part 



Gi(q) = J VQ\A(Q T q)\ 



(21) 



which is used to calculated the scattering function in this 
paper, and the contribution 



G£(q) := 



J VJZA(]2 T q) 



(22) 
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of the coherent part G 2 (q) = (S(q) - 1) | / VriA(rfq)\ . 

Neglecting the coherent part is plausible for two rea- 
sons. Firstly, because the contribution of G 2 (q) is small 
compared to the incoherent part Gi(q), as seen in the 
figure. And secondly, the length scale for the distances 
between the crystallites is much larger than atom length 
scales investigated here. On length scales we are interested 
in, we expect S(q) m 1, assuming that the crystallite po- 
sitions have no long range order. Therefore, the prefactor 
(5(q) — 1) additionally reduces contribution of the coher- 
ent term. 

The (002)-peak is special for the coherent scattering 
term G' 2 (q). All peaks except for the (002)-peak have a 
very small contribution in G 2 (q) because of the white av- 
erage of the crystallites' rotations about the fiber axis, 
which makes coherent scattering from different crystallites 
less likely, no matter how the crystallites are arranged in 
space. Since there is a preferential alignment of the crys- 
tallites in z-dircction, however, contributions of coherent 
scattering from different crystallites (which arc contained 
in the term G 2 (q)) are not completely destroyed; there- 
fore, if the crystallites' distance in z direction is a multiple 
of the unit cell size a z , causing a large contribution in the 
prefactor (S(q) — 1) at the position of the (002)-peaks, a 
contribution of G 2 (q) would be present. 
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Fig. 7. (Color online) Illustration of one unit cell, containing four neighboring poly-alanine strands. Top: configuration of 
the atoms. Center: Schematic view displaying only the backbone atoms. The arrows illustrate two possible adjustments of the 
structure, which are to be optimzied to match the experimental scattering image. Bottom: Simplified schematic view along the 
backbone axis a z , where © indicates an arrow pointing towards the reader and ® pointing away from the reader. Here the unit 
cell is shown as a gray rectangle. In each case, a parallel structure is shown on the left side, and an antiparallel structure on the 
right side. The illustration shows an unshifted configuration, that means: The C a atoms of neighboring strands are aligned and 
have no shift in z-direction (indicated as dashed line in the middle right image). Furthermore, the strands are exactly aligned 
in the x- and y-direction. 
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Fig. 9. (Color online) Scattering images of spider silk. The fiber axis runs vertical. On top, the colorbar shows scattering 
intensities, which are normalized by the intensity of the (120)-peak. Below the experimental scattering image of spider silk from 
Nephila clavipes is shown, both for bundle measurements (left) and single fiber diffraction (right) 



14 



S. Ulrich et al.: Diffraction from the /3-sheet crystallites in spider silk 



7 






m 











Fig. 10. (Color online) Calculated scattering images of the structure proposed by Marsh et al. (|35f). Unit cell size: (a x ,a y ,a z ) = 
(10.6,9.44,6.95) and we used the crystallite size (M x , M y , M z ) — (1.5,6,9). Left: The crystallites are purely made of alanine 
amino acids. Right: The crystallites' alanine amino acids are replaced with glycine with a probability p s \ — 0.375. 
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Fig. 11. (Color online) Scattering images, as calculated from eq. (|17[l . for the parallel structure on the left side and the 
antiparallel structure on the right side. 
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